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G. SEQUENCE LISTING 



5 (1) GENERAL INFORMATION 

(i) APPLICANT; Darrall Anderson, Nabil Hann«, John Leonard, 

Roland Newman and Mitchell RefFand William H. 
Rastetter 

10 

(ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF 

CHIMERIC AND RADIOLABELED 
ANTIBODIES TO HUMAN B 
LYMPHOCYTE RESTRICTED 
15 DIFFERENTIATION ANTIGEN FOR 

TREATMENT OF B CELL LYMPHOMA 

(iii) NUMBER OF SEQUENCES: 8 
20 (iv) CORRESPONDING ADDRESS: 

(A) ADDRESSEE: IDEC Pharmaceuticals Corporation 

(B) STREET: 11011 Torreyana Road 

(C) CITY: San Diego 
25 (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 92121 

(v) COMPUTER READABLE FORM: 

30 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: Microsoft Word 5.0 

35 

(vi CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

40 (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. Jr. 
45 (B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 



50 



(A) TELEPHONE: (619)550-8500 

(B) TELEFAX: (619)550-8750 



( 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8540 bases 

(B) TYPE: iroclaigaoid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 
15 (iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

^ GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

25 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

^ GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCC CCT 360 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

35 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGAC TTTCC A TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

40 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGG CGTGG AT AGCGGTTTGA CTCACGGGGA 780 

45 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

50 

CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 

CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 

55 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 



-64- 



AGCCTCAGCA GCACCCTGAC GCTGAGCAAA 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG 
5 TGTTGAATTC AGATCCGTTA ACGGTTACCA 

GGCCGTGATA TCTACGTATG ATCAGCCTCG 
GTTTGCCCCT C CCCCGTGCC TTCCTTGACC 

10 

TAATAAAATG AGGAAATTGC ATCGCATTGT 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT 
15 GCGGTGGGCT CTATGGAACC AGCTGGGGCT 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC 
TTCCTACTTG GCAGTACATC TACGTATTAG 

20 

GGCAGTACAT CAATGGGCGT GGATAGCGGT 
CCATTGACGT CAATGGGAGT TTGTTTTGGC 
25 GTAACAACTC CGCCCCATTG ACGCAAATGG 

TAAGCAGAGC TGGGTACGTC CTCACATTCA 
ATGGGTTGGA GCCTCATCTT GCTCTTCCTT 

30 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 
GCCCTGGGCT GCCTGGTCAA GGACTACTTC 
35 GGCGCCCTGA CCAGCGGCGT GCACACCTTC 

TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC 
AACGTGAATC ACAAGCCCAG CAACACCAAG 

40 

GACAAAACTC ACACATGCCC ACCGTGCCCA 
TTCCTCTTCC CCCCAAAACC CAAGGACACC 
45 TGCGTGGTGG TGGACGTGAG CCACGAAGAC 

GGCGTGGAGG TGCATAATGC CAAGACAAAG 
CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC 

50 

TGCAAGGTCT CCAACAAAGC CCTCCCAGCC 
GGGCAGCCCC GAGAACCACA GGTGTACACC 
55 AACCAGGTCA GCCTGACCTG CCTGGTCAAA 

TGGGAGAGCA ATGGGCAGCC GGAGAACAAC 
GACGGCTCCT TCTTCCTCTA CAGCAAGCTC 

60 

AACGTCTTCT CATGCTCCGT GATGCATGAG 
CTCTCCCTGT CTCCGGGTAA ATGAGGATCC 



( 

GCAGACTACG AGAAACACAA AGTCTACGCC 132 0 

CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1380 

ACTACCTAGA CTGGATTCGT GACAACATGC 1440 

ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 

CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 

CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 

TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 

CGACAGCTAT GCCAAGTACG CCCCCTATTG 1740 

ATTATGCCCA GTACATGACC TTATGGGACT 1800 

TCATCGCTAT TACCATGGTG ATGCGG TTTT 1860 

TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 

ACCAAAATCA ACGGGACTTT CCAAAATGTC 1980 

GCGGTAGGCG TGTACGGTGG GAGGTCTATA 204 0 

GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 

GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 

TCCTCCAAGA GCACCTCTGG GGGCACAGCG 2220 

CCCGAACCGG TGACGGTGTC GTGGAACTCA 22 80 

CCGGCTGTCC TACAGTCCTC AGGACTCTAC 23 40 

AGCAGCTTGG GCACCCAGAC CTACATCTGC 2 400 

GTGGACAAGA AAGCAGAGCC CAAATCTTGT 2460 

GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 2520 

CTCATGATCT CCCGGACCCC TGAGGTCACA 2580 

CCTGAGGTCA AGTTCAACTG GTACGTGGAC 2640 

CCGCGGGAGG AGCAGTACAA CAGCACGTAC 27 00 

CAGGACTGGC TGAATGGCAA GGAGTACAAG 2760 

CCCATCGAGA AAACCATCTC CAAAGCCAAA 2 820 

CTGCCCCCAT CCCGGGATGA GCTGACCAAG 2 8 80 

GGCTTCTATC CC AGCGAC AT CGCCGTGGAG 2940 

TACAAGACCA CGCCTCCCGT GCTGGACTCC 3000 

ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 30 60 

GCTCTGCACA ACCACTACAC GCAGAAGAGC 3120 

GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
-65- 
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TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 3240 
GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3300 
CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3360 
BTA^WtacM ac«fciHciia*o WtttaPfuac^em «DAArtwou«A dOAWcmaAA. oacaatauKA ins 

GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 3480 
CCCGATCCCC AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3540 
AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3660 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 

GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 37 80 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3840 

CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3900 

CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3960 

GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4020 

GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 

TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 

CCTGAGAAGA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 

GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 42 6 C 

GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTC TGTT 4320 

TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 4380 

GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 

GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4500 

GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 

CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 

TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4680 

TGCCACTCCC ACTGTCC TTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTC TGAGTAG 4740 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4 800 

CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 

TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 

ACCAATTCAG TAGTTGATTG AGC AAATGCG TTGC C AAAAA GGATGCTTTA GAGACAGTGT 4980 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 
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GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 

CGTAGAGCCA C AC CTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 

5 AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 

TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 

TCCQGCCGCT TOCCTOQAOA QGCTATTCQO eTATQACTOO GCACAACAOA CAATCOQCTQ 33 40 

10 

CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 

CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 5460 

15 CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 

GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 

GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5640 

20 

CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 5700 

TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 57 60 

25 CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 5820 

CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 

GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 5940 

30 

GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 6000 

GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC TGAGCGGGAC TCTGGGGTTC 6060 

35 GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 6120 

TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 

CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 6240 

40 

GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 6300 

TCTAGTTGTG GTTTGTCCAA ACTCATCAAT CTATCTTATC ATGTCTGGAT CGCGGCCGCG 6360 

45 ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 6420 

CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 6480 

AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 6540 

50 

ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 6600 

TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6660 

55 GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 6720 

CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGC C AG GAACCGTAAA AAGGCCGCGT 6780 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 6840 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 6900 

CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 6960 
-67- 



60 



20 



55 



CTTCGGGAAG 


CGTGGCGCTT 


TCTCAATGCT 


CACGCTGTAG 


GTATCTCAGT 


TCGGTGTAGG 


7020 


TCGTTCGCTC 


CAAGCTGGGC 


TGTGTGCACG 


AACCCCCCGT 


TCAGCCCGAC 


CGCTGCGCCT 


7080 


TATCCGGTAA 


CTATCGTCTT 


GAGTCCAACC 


CGGTAAGACA 


CGACTTATCG 


C CACTGGC AG 


7140 


CAOCC&CTOQ 


TAACAOQATT 




QOTATC1T&Q0 


CQQTQQT&CA 


CAOTTCTTOa 


7300 


AGTGGTGGCC 


TAACTACGGC 


TACACTAGAA 


GGACAGTATT 


TGGTATCTGC 


GCTCTGCTGA 


72S0 


AGCCAGTTAC 


CTTCGGAAAA 


AGAGTTGGTA 


GCTCTTGATC 


CGGCAAACAA 


ACCACCGCTG 


7320 


GTAGCGGTGG 


TTTTTTTGTT 


TGCAAGCAGC 


AGATTACGCG 


CAGAAAAAAA 


GGATCTCAAG 


7380 


AAGATCCTTT 


GATCTTTTCT 


ACGGGGTCTG 


ACGCTCAGTG 


GAACGAAAAC 


TCACGTTAAG 


7440 


GGATTTTGGT 


CATGAGATTA 


TCAAAAAGGA 


TCTTCACCTA 


GATCCTTTTA 


AATTAAAAAT 


7500 


GAAGTTTTAA 


ATCAATCTAA 


AGTATATATG 


AGTAAACTTG 


GTCTGACAGT 


TACCAATGCT 


7560 


TAATCAGTGA 


GGCACCTATC 


TCAGCGATCT 


GTCTATTTCG 


TTCATCCATA 


GTTGCCTGAC 


7620 


TCCCCGTCGT 


GTAGATAACT 


ACGATACGGG 


AGGGCTTACC 


ATCTGGCCCC 


AGTGCTGCAA 


7680 


TGATACCGCG 


AGACCCACGC 


TCACCGGCTC 


CAGATTTATC 


AGCAATAAAC 


CAGCCAGCCG 


7740 


GAAGGGCCGA 


GCGCAGAAGT 


GGTCCTGCAA 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


7800 


GTTGCCGGGA 


AGCTAGAGTA 


AGTAGTTCGC 


CAGTTAATAG 


TTTGCGCAAC 


GTTGTTGCCA 


7860 


TTGCTACAGG 


CATCGTGGTG 


TCACGCTCGT 


CGTTTGGTAT 


GGCTTCATTC 


AGCTCCGGTT 


7920 


CCCAACGATC 


AAGGCGAGTT 


ACATGATCCC 


CCATGTTGTG 


CAAAAAAGCG 


GTTAGCTCCT 


7980 


TCGGTCCTCC 


GATCGTTGTC 


AGAAGTAAGT 


TGGCCGCAGT 


GTTATCACTC 


ATGGTTATGG 


8040 


CAGCACTGCA 


TAATTCTCTT 


ACTGTCATGC 


CATCCGTAAG 


ATGCTTTTCT 


GTG AC TGGTG 


8100 


AGTACTCAAC 


CAAGTCATTC 


TGAGAATAGT 


GTATGCGGCG 


ACCGAGTTGC 


TCTTGCCCGG 


8160 


CGTCAATACG 


GGATAATACC 


GCGCCACATA 


GCAGAACTTT 


AAAAGTGCTC 


ATCATTGGAA 


8220 


AACGTTCTTC 


GGGGCGAAAA 


CTCTCAAGGA 


TCTTACCGCT 


GTTGAGATCC 


AGTTCGATGT 


8280 


AACCCACTCG 


TGCACCCAAC 


TGATCTTCAG 


CATCTTTTAC 


TTTCACCAGC 


GTTTCTGGGT 


8340 


GAGCAAAAAC 


AGGAAGGCAA 


AATGCCGCAA 


AAAAGGGAAT 


AAGGGCGACA 


CGGAAATGTT 


8400 


GAATACTCAT 


ACTCTTCCTT 


TTTCAATATT 


ATTGAAGCAT 


TTATCAGGGT 


TATTGTCTCA 


8460 


TGAGCGGATA 


CATATTTGAA 


TGTATTTAGA 


AAAATAAACA 


AATAGGGGTT 


CCGCGCACAT 


8520 


TTCCCCGAAA 


AGTGCCACCT 
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(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9209 bases 
-68- 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yen 

(iv) ANTI-SENSE: no 

10 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

15 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

20 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 3 60 

25 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

30 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

^ AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 

40 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

45 

CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 

GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

50 GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 1140 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 

55 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 1320 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 13 80 

60 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1440 



( 



TGCCTGCTGA ATAACTTCTA TCCCAGAGAG 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC 
5 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG 
TGTTGAATTC AGATCCGTTA ACGGTTACCA 

10 

GGCCGTGATA TCTACGTATG ATCAGCCTCG 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC 
15 TAATAAAATG AGGAAATTGC ATCGCATTGT 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT 
GCGGTGGGCT CTATGGAACC AGCTGGGGCT 

20 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC 
TTCCTACTTG GCAGTACATC TACGTATTAG 
25 GGCAGTACAT CAATGGGCGT GGATAGCGGT 

CCATTGACGT CAATGGGAGT TTGTTTTGGC 
GTAACAACTC CGCCCCATTG ACG CAAATGG 

30 

TAAGCAGAGC TGGGTACGTC CTCACATTCA 
ATGGGTTGGA GCCTCATCTT GCTCTTCCTT 
35 GTACAACTGC AGCAGCCTGG GGC TGAGCTG 

TGCAAGGCTT CTGGCTACAC ATTTACCAGT 
GGTCGGGGCC TGGAATGGAT TGGAGCTATT 

40 

CAGAAGTTCA AAGGCAAGGC CACATTGACT 
CAGCTCAGCA GCCTGACATC TGAGGACTCT 
45 TACGGCGGTG ACTGGTACTT CAATGTCTGG 

GCTAGCACCA AGGGCCCATC GGTCTTCCCC 
GGCACAGCGG CCCTGGGCTG CCTGGTCAAG 

50 

TGGAACTCAG GCGCCCTGAC CAGCGGCGTG 
GGACTCTACT CCCTCAGCAG CGTGGTGACC 
55 TACATCTGCA ACGTGAATCA CAAGCCCAGC 

AAATCTTGTG ACAAAACTCA CACATGCCCA 
CCGTCAGTCT TCCTCTTCCC CCCAAAACCC 

60 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC 
TACGTGGACG GCGTGGAGGT GCATAATGCC 



GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 

ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1560 

GCAGACTACG AGAAACACAA AGTCTACGCC 1620 

CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1680 

ACTACCTAGA CTGGATTCGT GACAACATGC 1740 

ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1800 

CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1860 

CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1920 

TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1980 

CGACAGCTAT GCCAAGTACG CCCCCTATTG 2040 

ATTATGCCCA GTACATGACC TTATGGGACT 2100 

TCATCGCTAT TACCATGGTG ATGCGGTTTT 2160 

TTGACTCACG GGGATTTCCA AGTCTCCACC 2220 

ACCAAAATCA ACGGGACTTT CCAAAATGTC 2280 

GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2340 

GTGATCAGCA CTGAACACAG ACCCGTCGAC 2400 

GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 

GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 

TACAATATGC ACTGGGTAAA ACAGACACCT 2580 

TATCCCGGAA ATGGTGATAC TTCCTACAAT 2640 

GCAGACAAAT CCTCCAGCAC AGCCTACATG 2700 

GCGGTCTATT ACTGTGCAAG ATCGACTTAC 2760 

GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 2820 

CTGGCACCCT CCTCCAAGAG CACCTCTGGG 2880 

GACTACTTCC CCGAACCGGT GACGGTGTCG 2940 

CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 

GTGCCCTCCA GCAGCTTGGG CACCCAGACC 3060 

AACACCAAGG TGGACAAGAA AGCAGAGCCC 3120 

CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3130 

AAGGACACCC TCATGATCTC CCGGACCCCT 3240 

CACGAAGACC CTGAGGTCAA GTTCAACTGG 3300 

AAGACAAAGC CGCGGGAGGA GCAGTACAAC 3360 
-70- 
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AGCACGTACC 
GAGTACAAGT 
AAAGCCAAAG 
CTGACCAAC3A 

GCCGTGGAGT 
CTGGACTCCG 
CAGCAGGGGA 
CAGAAGAGCC 
AGACTGGATT 
CTTCTAGTTG 
GTGCCACTCC 
GGTGTCATTC 
ACAATAGCAG 
GCTGGATCTC 
AGGAAAATTA 
ATGCTTTAGA 
CAGAGCTGAG 
ATCACCGAAG 
ATAGAGAGGG 
CATTTGCTTC 
ATTTCGCGCC 
GCCATCATGG 
AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 



GTGTGGTCAG 
GCAAGGTCTC 
GGCAGCCCCG 
ACCAC3QTCAQ 

GGGAGAGCAA 
ACGGCTCCTT 
ACGTCTTCTC 
TCTCCCTGTC 
CGTGACAACA 
CCAGCCATCT 
CACTGTCCTT 
TATTCTGGGG 
GCATGCTGGG 
CCGATCCCCA 
ATTTTAACAC 
GACAGTGTTC 
ACTCCTAAGC 
CCTGATTCCG 
CAGGAGCCAG 
TGACATAGTT 
AAACTTGACG 
TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 
CTGAGAAGAA 
AACCACCACG 
AACAACCGGA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 



CGTCCTCACC 
CAACAAAGCC 
AGAACCACAG 

TGGGCAGCCG 
CTTCCTCTAC 
ATGCTCCGTG 
TCCGGGTAAA 
TGCGGCCGTG 
GTTGTTTGCC 
TCCTAATAAA 
GGTGGGGTGG 
GATGCGGTGG 
GCTTTGCTTC 
CAATTCAGTA 
TCTGCACAGA 
CAGTGAGTGG 
TAGAGCCACA 
GGCAGAGCAT 
GTGTTGGGAG 
GCAATCCTAG 
GAACTGCATC 
GCCTCCGCTC 
AGGTAAACAG 
TCGACCTTTA 
AGGAGCTCAT 
ATTGGCAAGT 
CATGAATCAA 
TGACACGTTT 
CGTCCTCTCT 
GAAAGACTAA 



GTCCTGCACC 
CTCCCAGCCC 
GTGTACACCC 
CTQOTCAAAO 

GAGAACAACT 
AGCAAGCTCA 
ATGCATGAGG 
TGAGGATCCG 
ATATCTACGT 
CCTCCCCCGT 
ATGAGGAAAT 
GGCAGGACAG 
GCTCTATGGA 
TCAATTTCTT 
GTTGATTGAG 
TAAGGACAAA 
CACAGCATTC 
CCTTGGTAAG 
ATAAGGTGAG 
CTTGGATAGC 
CGTGAAGGCT 
GTCGCCGTGT 
AGGAACGAGT 
AATCTGGTGA 
AAGGACAGAA 
TTTCTTGCCA 
AAAGTAGACA 
CCAGGCCACC 
TTCCCAGAAA 
GAGGTCCAGG 
CAGGAAGATG 



AGGACTGGCT 
CCATCGAGAA 
TGCCCCCATC 
OCTTCTATCC 
ACAAGACCAC 
CCGTGGACAA 
CTCTGCACAA 
TTAACGGTTA 
ATGATCAGCC 
GCCTTCCTTG 
TGCATCGCAT 
CAAGGGGGAG 
ACCAGCTGGG 
ATTTGCATAA 
CAAATGCGTT 
CATTATTCAG 
TAGGGAGAAA 
GGCCAATCTG 
GTAGGATCAG 
TTGGACAGCT 
GGTAGGATTT 
CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 
TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 



GAATGGCAAG 
AACCATCTCC 
CCGGGATGAG 
CAOCOAOA'TC 
GCCTCCCGTG 
GAGCAGGTGG 
CCACTACACG 
CCAACTACCT 
TCGACTGTGC 
ACCCTGGAAG 
TGTCTGAGTA 
GATTGGGAAG 
GCTCGACAGC 
TGAGAAAAAA 
GCCAAAAAGG 
AGGGAGTACC 
TATGCTTGTC 
CTCACACAGG 
TTGCTCCTCA 
CAGGGCTGCG 
TATCCCCGCT 
GGGGATTGGC 
CCAAAGAATG 
GAAAAC CTGG 
TCTCAGTAGA 
TGATGCCTTA 
AGTCGGAGGC 
TGTGACAAGG 
GAAATATAAA 
CATCAAGTAT 
CTCTGCTCCC 



3420 

3480 

3540 

3sao 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 
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CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTCGGAAOAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CGCAGGTTCT 
AATCGGCTGC 
TGTCAAGACC 
GTGGCTGGCC 
AAGGGACTGG 
TCCTGCCGAG 
GGCTACCTGC 
GGAAGCCGGT 
CGAACTGTTC 
TGGCGATGCC 
CTGTGGCCGG 
TGCTGAAGAG 
TCCCGATTCG 
CTGGGGTTCG 
ACCGCCGCCT 
ATCCTCCAGC 
GCTTATAATG 
TCACTGCATT 
GCGGCCGC G A 
ATTGTTATCC 



TATGCATTTT 
TCTAGTTGCC 
GCCACTCCCA 
TGTCATTCTA 
AATAGCAGGC 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
CCGGCCGCTT 
TCTGATGCCG 
GACCTGTCCG 
ACGACGGGCG 
CTGCTATTGG 
AAAGTATCCA 
CCATTCGACC 
CTTGTCGATC 
GCCAGGCTCA 
TGCTTGCCGA 
CTGGGTGTGG 
CTTGGCGGCG 
CAGCGCATCG 
AAATG AC CGA 
TCTATGAAAG 
GCGGGGATCT 
GTTACAAATA 
CTAGTTGTGG 
TCCCGTCGAG 
GCTCACAATT 



TATAAGACCA 
AGCCATCTGT 
CTGTCCTTTC 
TTCTGGGGGG 



ATacraaaaA 
CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATCG 
GGGTGGAGAG 
CCGTGTTCCG 
GTGCCCTGAA 
TTCCTTGCGC 
GCGAAGTGCC 
TCATGGCTGA 
AC C AAGCGAA 
AGGATGATCT 
AGGCGCGCAT 
ATATCATGGT 
CGGACCGCTA 
AATGGG CTG A 
CCTTCTATCG 
CCAAGCGACG 
GTTGGGCTTC 
CATGCTGGAG 
AAGCAATAGC 
TTTGTCCAAA 
AGCTTGGCGT 
CCACACAACA 



TGGGACTTTT 
TGTTTGCCCC 
CTAATAAAAT 
TGGGGTGGGG 
TOcoa-rcsGac 

TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGGGAGAA 
GGGC CAATCT 
GGTAGGATCA 
ATCCTCTATG 
GCTATTCGGC 
GCTGTCAGCG 
TGAACTGCAG 
AGC TGTGCTC 
GGGGCAGGAT 
TGCAATGCGG 
ACATCGCATC 
GGACGAAGAG 
GCCCGACGGC 
GGAAAATGGC 
TCAGGACATA 
CCGCTTCCTC 
CCTTCTTGAC 
CCCAACCTGC 
GGAATCGTTT 
TTCTTCGCCC 
ATCACAAATT 
CTCATCAATC 
AATCATGGTC 
TACGAGCCGG 
-72- 



GCTGGCTTTA 
TCCCCCGTGC 
GAGGAAATTG 
CAGGACAGCA 
TCTATGGAAC 
ATGAGAAAAA 
TGC C AAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
GTTGAACAAG 
TATGACTGGG 
CAGGGGCGCC 
GACGAGGCAG 
GACGTTGTCA 
CTCCTGTCAT 
CGGCTGCATA 
GAGCGAGCAC 
CATCAGGGGC 
GAGGATCTCG 
CGCTTTTCTG 
GCGTTGGCTA 
GTGCTTTACG 
GAGTTCTTCT 
CATCACGAGA 
TCCGGGACGC 
ACCCCAACTT 
TCACAAATAA 
TATCTTATCA 
ATAGCTGTTT 
AAGCATAAAG 



GATCAGCCTC 
CTTCCTTGAC 
CATCGCATTG 
AGGGGGAGGA 
CAQCTGGGGC 
AAGGAAAATT 
GATGCTTTAG 
CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
ATGGATTGCA 
CACAACAGAC 
CGGTTCTTTT 
CGCGGCTATC 
CTGAAGCGGG 
CTCACCTTGC 
CGCTTGATCC 
GTACTCGGAT 
TCGCGCCAGC 
TCGTGACCCA 
GATTCATCGA 
CCCGTGATAT 
GTATCGCCGC 
GAGCGGGACT 
TTTCGATTCC 
CGGCTGGATG 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAAGCCT 



5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 
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GGGGTGCCTA 


ATGAGTGAGC 


TAACTCACAT 


TAATTGCGTT 


GCGCTCACTG 


CCCGCTTTCC 


7200 


AGTCGGGAAA 


CCTGTCGTGC 


CAGCTGCATT 


AATGAATCGG 


CCAACGCGCG 


GGGAGAGGCG 


7260 


GTTTGCGTAT 


TGGGCGCTCT 


TCCGCTTCCT 


CGCTCACTGA 


CTCGCTGCGC 


TCGGTCGTTC 


7320 




AGCQCTATCA 


aCTCACTCAA 


AaacaoTAAT 


ACOOTTATCC 


ACAOAATCA« 


7380 


GGGATAACGC 


AGGAAAGAAC 


ATGTGAGCAA 


AAGGCCAGCA 


AAAGGCCAGG 


AACCGTAAAA 




AGGCCGCGTT 


GCTGGCGTTT 


TTCCATAGGC 


TCCGCCCCCC 


TGACGAGCAT 


CACAAAAATC 




GACGCTCAAG 


TCAGAGGTGG 


CGAAACCCGA 


CAGGACTATA 


AAGATACCAG 






CTGGAAGCTC 


CCTCGTGCGC 


TCTCCTGTTC 


CGACCCTGCC 


GCTTACCGGA 


TACCTGTCCG 




CCTTTCTCCC 


TTCGGGAAGC 


GTGGCGCTTT 


CTCAATGCTC 


ACGCTGTAGG 






CGGTGTAGGT 


CGTTCGCTCC 


AAGCTGGGCT 


GTGTGCACGA 


ACCCCCCGTT 


CAGCCCGACC 


7740 


GCTGCGCCTT 


ATCCGGTAAC 




AGTCCAACCC 


GGTAAGACAC 


GACTTATCGC 


7800 


CACTGGCAGC 


AGCCACTGGT 


AACAGGATTA 


GCAGAGCGAG 


GTATGTAGGC 


GGTGCTACAG 


7860 


AGTTC TTGAA 


GTGGTGGCCT 


nHL 1 Awtjvjt- 1 


ACACTAGAAG 


GACAGTATTT 


GGTATCTGCG 


7920 


CTCTGCTGAA 


GCCAGTTACC 


TTCGGAAAAA 


GAGTTGGTAG 


CTCTTGATCC 


GGCAAACAAA 




CCACCGCTGG 


TAGCGGTGGT 


TTTTTTGTTT 


GCAAGCAGCA GATTACGCGC 


AGAAAAAAAG 


8040 


GATCTCAAGA 


AGATCCTTTG 




CGGGGTCTGA 


CGCTC AG TGG 


AACGAAAACT 




CACGTTAAGG 




ATGAGATTAT 


CAAAAAGGAT 


CTTCACCTAG 


ATCCTTTTAA 




ATTAAAAATG 


AAGTTTTAAA 


TCAATCTAAA 


GTATATATGA 


GTAAACTTGG 


TCTGACAGTT 




ACCAATGCTT 


AATCAGTGAG 


GCACCTATCT 


CAGCGATCTG 


TCTATTTCGT 


TCATCCATAG 


8280 


TTGC CTG AC T 


CCCCGTCGTG 


T AG AT AAC T A 


CGATACGGGA 


GGGCTTACCA 


TCTGGCCCCA 




GTGCTGCAAT 


GATACCGCGA 


GACCCACGCT 


CACCGGCTCC 


AGATTTATCA 


GCAATAAACC 




AGCCAGCCGG 


AAGGGCCGAG 


CGCAGAAGTG 


GTCCTGCAAC 


TTTATCCGCC 






CTATTAATTG 


TTGCCGGGAA 


GCTAGAGTAA 


GTAGTTCGCC 


AGTTAATAGT 


TTGCGCAACG 




TTGTTGCCAT 


TGCTACAGGC 


ATCGTGGTGT 


CACGCTCGTC 


GTTTGGTATG 


GCTTCATTCA 




GCTCCGGTTC 


CCAACGATCA 


AGGCGAGTTA 


CATGATCCCC 


CATGTTGTGC 


AAAAAAGCGG 




TTAGCTCCTT 


CGGTCCTCCG 


ATCGTTGTCA 


GAAGTAAGTT 


GGCCGCAGTG 


TTATCACTCA 




TGGTTATGGC 


AGCACTGCAT 


AATTCTCTTA 


CTGTCATGCC 


ATCCGTAAGA 


TGCTTTTCTG 


8760 


TGACTGGTGA 


GTACTCAACC 


AAGTCATTCT 


GAGAATAGTG 


TATGCGGCGA 


CCGAGTTGCT 


8820 


CTTGCCCGGC 


GTCAATACGG 


GATAATACCG 


CGCCACATAG 


CAGAACTTTA 


AAAGTGCTCA 


8380 


TCATTGGAAA 


ACGTTCTTCG 


GGGCGAAAAC 


TCTCAAGGAT 


CTTACCGCTG 


TTGAGATCCA 


8940 


GTTCGATGTA 


ACCCACTCGT 


GCACCCAACT 


GATCTTCAGC 


ATCTTTTACT 


TTCACCAGCG 


9000 
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TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 90 60 

GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 9120 

5 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 9180 

CGCGCACATT TCCCCGAAAA GTGCCACCT 9209 

10 (4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 bases 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

25 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

5 ' ATC ACA GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 52 
TC 3 ' 2 

30 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 

45 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5' TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT 3' 30 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

55 



50 
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(A) LENGTH: 384 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 
(ili) HYPOTHETICAL: yes 
10 (iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

15 ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA GTC 51 

ATA ATG TCC AGA GGG CAA ATT GTT CTC TCC CAG TCT CCA GCA ATC CTG TCT 102 

GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC TCA AGT GTA 153 

20 

AGT TAC ATC CAC TGG TTC CAG CAG AAG CCA GGA TCC TCC CCC AAA CCC TGG 204 

ATT TAT GCC ACA TCC AAC CTG GCT TCT GGA GTC CCT GTT CGC TTC AGT GGC 255 

25 AGT GGG TCT GGG ACT TCT TAC TCT CTC ACA ATC AGC AGA GTG GAG GCT GAA 3 06 

GAT GCT GCC ACT TAT TAC TGC CAG CAG TGG ACT AGT AAC CCA CCC ACG TTC 3 57 

GGA GGG GGG ACC AAG CTG GAA ATC AAA 384 



30 



35 



(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

- (A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

5 • GCG GCT CCC ACG CGT GTC CTG TCC CAG 3 ' 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

5' GGS TGT TGT GCT AGC TGM RGA GAC RGT GA 3 ' 29 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATG 


GGT 


TGG 


AGC 


CTC 


ATC 


TTG 


CTC 


TTC 


CTT 


GTC 


GCT 


GTT 


GCT 


ACG 


CGT 


GTC 


51 


CTG 


TCC 


CAG 


GTA 


CAA 


CTG 


CAG 


CAG 


CCT 


GGG 


GCT 


GAG 


CTG 


GTG 


AAG 


CCT 


GGG 


102 


GCC 


TCA 


GTG 


AAG 


ATG 


TCC 


TGC 


AAG 


GCT 


TCT 


GGC 


TAC 


ACA 


TTT 


ACC 


AGT 


TAC 


153 


AAT 


ATG 


CAC 


TGG 


GTA 


AAA 


CAG 


ACA 


CCT 


GGT 


CGG 


GGC 


CTG 


GAA 


TGG 


ATT 


GGA 


204 


GCT 


ATT 


TAT 


CCC 


GGA 


AAT 


GGT 


GAT 


ACT 


TCC 


TAC 


AAT 


CAG 


AAG 


TTC 


AAA 


GGC 


255 


AAG 


GCC 


ACA 


TTG 


ACT 


GCA 


GAC 


AAA 


TCC 


TCC 


AGC 


ACA 


GCC 


TAC 


ATG 


CAG 


CTC 


305 


AGC 


AGC 


CTG 


ACA 


TCT 


GAG 


GAC 


TCT 


GCG 


GTC 


TAT 


TAC 


TGT 


GCA 


AGA 


TCG 


ACT 


357 


TAC 
ACC 


TAC 
GTC 


GGC 
TCT 


GGT 
GCA 


GAC 


TGG 


TAC 


TTC 


AAT 


GTC 


TGG 


GGC 


GCA 


GGG 


ACC 


ACG 


GTC 


403 
420 
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